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Abstract 

Coniparative nnitochondrialgenonnics of arbuscularnnycorrhizalfurigi(AMF) provide new averiues to overconnelong-lastirig obstacles 
that have hampered studies ainned at understanding the connmunity structure, diversity, and evolution of these multinucleated and 
genetically polymorphic organisms. AMF mitochondrial (mt) genomes are homogeneous within isolates, and their intergenic regions 
harbor numerous mobile elements that have rapidly diverged, including homing endonuclease genes, small inverted repeats, and 
plasmid-related DNA polymerase genes (dpo), making them suitable targets for the development of reliable strain-specific markers. 
However, these elements may also lead to genome rearrangements through homologous recombination, although this has never 
previously been reported in this group of obligate symbiotic fungi. To investigate whether such rearrangements are present and 
caused by mobile elements in AMF, the mitochondrial genomes from two Glomeraceae members (i.e.. Glomus cerebriforme and 
G/omussp.) with substantial mtDNA synteny divergence, were sequenced and compared with available glomeromycotan mitochon- 
drial genomes. We used an extensive nucleotide/protein similarity network-based approach to investigate dpo diversity in AMF as well 
as in other organisms for which sequences are publicly available. We provide strong evidence of dpo-induced inter-haplotype 
recombination, leading to a reshuffled mitochondrial genome in Glomus sp. These findings raise questions as to whether AMF 
single spore cultivations artificially underestimate mtDNA genetic diversity. We assessed potential dpo dispersal mechanisms in AMF 
and inferred a robust phylogenetic relationship with plant mitochondrial plasmids. Along with other indirect evidence, our analyses 
indicate that members of the Glomeromycota phylum are potential donors of mitochondrial plasmids to plants. 

Key words: arbuscular mycorrhizal fungi, mitochondrial genome rearrangements, homologous recombination, plasmid-related 
DNA polymerase genes, short inverted repeats, nucleotide/protein similarity network. 



Introduction 

The association between arbuscular mycorrhizal fungi (AMF) 
and plant roots is one of the most widespread symbioses in- 
volving plants, and thus has an important role in terrestrial 
ecosystems (Smith and Read 2008). In exchange for carbohy- 
drates, AMF improve plant fitness by enhancing mineral nu- 
trient and water uptake (Ruiz-Lozano et al. 1995). AMF have 
also been shown to alleviate salt stress (Evelin et al. 2009) and 
to provide protection against root pathogens (St-Arnaud and 
Vujanovik 2007; Ismail and Hijri 2012). Despite the fact that 
these symbioses contribute to important services in natural, 
agricultural and reclaimed ecosystems (Gianinazzi etal. 2010), 



the species richness, community structure and functional di- 
versity of AMF is not well understood (Wehner et al. 2010) 
due to a lack of reliable molecular tools. The intra-isolate ge- 
netic diversity of nuclear DNA and RNA sequences observed in 
AMF (Sanders et al. 1995; Kuhn et al. 2001; Pawlowska and 
Taylor 2004; Mathimaran et al. 2008; Stockinger et al. 2009; 
Boon et al. 2010), has made it difficult to determine evolu- 
tionary relatedness at high levels of resolution (i.e., between 
genetically similar species and/or isolates). 

In contrast, the first mitochondrial (mt) genome sequenced 
from Glomus irregulare (synonym Rhizophagus irregularis; 
formerly G. intraradices) was shown to be homogeneous 
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(Lee and Young 2009), a state often referred to as homoplas- 
mic. Since then, the mitochondrial large subunit rRNA gene 
has been explored for its usefulness as a marker (Raab et al. 
2005; Borstler et al. 2008; Thiery et al. 2010), but determina- 
tion of its specificity at the isolate level is challenging due to a 
lack of genetic variability. Recently, the mitochondrial ge- 
nomes of six G. irregulare isolates (Formey et al. 2012) and 
a closely related species (Beaudet et al. 201 3) were compared. 
Striking features of these data sets were the presence and 
variability among isolates of numerous mobile elements such 
as plasmid-related DNA polymerase genes (dpo), short in- 
verted repeats (SIRs), and homing endonuclease genes, 
which have given rise to mitochondrial gene hybrids through 
horizontal gene transfer (HGT). Intra-specific divergence in the 
eroded portions of these elements revealed the high value of 
designing much-needed isolate-specific molecular markers 
(Corradi and Bonen 2012). 

Mitochondrial DNA has long been considered a valuable 
tool in evolutionary biology because of its high mutation rate 
and the assumption that individuals only possessed one 
mtDNA haplotype, directly maternally inherited, without re- 
combination (Birky et al. 1978; Birky 2001). However, reports 
of mitochondrial recombination events and heteroplasmy in 
animals, but especially in plants and fungi, led to re-evaluate 
this assertion (Barr et al. 2005; White et al. 2008). Molecular 
recombination is an important evolutionary mechanism that 
allows deleterious mutations removal and generates allelic di- 
versity in a population. The main causes of heteroplasmy are 
bi-parental inheritance and/or mutations, which result in the 
coexistence of mitochondrial genomes that differ in either 
nucleotide composition (site heteroplasmy) or length (length 
heteroplasmy). The latter can involve large-scale rearrange- 
ments or insertiorVloss of coding regions (Boursot et al. 
1987; Volz-Lingenhohl et al. 1992). In AMF, transient mito- 
chondrial length heteroplasmy through anastomosis has 
recently been demonstrated for geographically distant 
G. irregulare in vitro isolates (de la Providencia et al. 2013). 
The divergence in length between the two mtDNA haplotypes 
was caused by a variation in plasmid-related dpo insertions. 
The integration of elements such as dpo and SIRs has been 
shown to produce genomic rearrangements through recom- 
bination in other organisms (Schofield et al. 1992; Bi and Liu 
1996; Paquin et al. 2000; Ferandon et al. 2008; Tanaka et al. 
2012). 

Integrated plasmid-related dpo genes have been reported 
in all AMF mitochondrial genomes that have been sequenced 
thus far. Plasmids are self-replicating extra-chromosomal 
DNA molecules that were originally identified in bacteria. 
Analogous molecules have now been reported in eukaryotes, 
notably plants and numerous fungal species. The majority of 
plasmids described in filamentous fungi are strictly mitochon- 
drial, but cases of nuclear localization have been reported in 
yeasts (reviewed in Griffiths [1995]) and a Mucor-Wke fungus 
(Hanfler et al. 1992). Plasmid structure varies widely, from 



circular molecules to linear invertrons with terminal inverted 
repeats (TIRs). Both types harbor one or two open reading 
frames (ORFs) encoding DNA and/or RNA polymerase genes 
(Nargang et al. 1984; Akins et al. 1988; Pande et al. 1989; 
Sakaguchi 1990; Shiu-Shing Chan et al. 1991; Court and 
Bertrand 1992; Li and Nargang 1993). Plasmids have also 
been shown to integrate into fungal mtDNA through recom- 
bination and can either be cryptic or elicit changes in pheno- 
typic expression (Akins et al. 1986; Robison et al. 1991; 
Griffiths 1992; Hanfler et al. 1992; Oeser et al. 1993; 
Hermanns et al. 1994). Such insertions have been reported 
to be acquired vertically from a common ancestor in plants 
(Robison and Wolyn 2005) and also in fungi (Robison et al. 
1991). However, numerous cases of HGT of those elements 
have been reported in fungi, offering an explanation for their 
ubiquitous distribution in this group (Collins and Saville 1990; 
Griffiths et al. 1990; Kempken et al. 1992; Arganoza et al. 
1994; Debets et al. 1994). Furthermore, plant mitochondrial 
plasmids are suspected to have been acquired from a fungal 
donor based on their similar structure and low GC content, 
which is a common feature of fungal mitochondrial genomes 
(reviewed in Rosewitch and Kistler [2000]). 

Another type of mobile element found in the mtDNA of 
Glomeromycota, SIRs (Formey et al. 2012), were observed 
in the mitochondrial genome of the green algae 
Chlamydomonas reinhardtii (Boer and Gray 1991), but their 
structure is reminiscent of GC clusters in yeast (de Zamaroczy 
and Bernardi 1986) and the Pst I palindromes of Neurospora 
crassa (Yin et al. 1981). SIRs can be folded into hairpin sec- 
ondary structures, and have been suspected to have created 
genome rearrangements in Chlamydomonas spp. by intramo- 
lecular recombination (Denovan-Wnght and Lee 1994; 
Nedelcu and Lee 1998) through a model that is analogous 
to what has been observed in fungi (Almasan and Mishra 
1991; Weiller et al. 1991), plants (Andre et al. 1992) and 
bacteria (Schofield et al. 1992; Bi and Liu 1996). They are 
putatively mobile, although their transposition mechanism is 
not well understood (Grindley and Reed 1985; Nakazono et al. 
1994). Nevertheless, mobility has been demonstrated in sim- 
ilar repeats that fold into double hairpin structures in 
Allomyces spp. mtDNA (Paquin et al. 2000). 

No information about the underlying mechanisms and 
consequences of such integrations into the mtDNA of 
Glomeromycota is currently available. Understanding whether 
mitochondrial recombination occurs in AMF is important be- 
cause it can modify the interpretation of several aspects of 
research, including phylogenetic and population genetic anal- 
yses (Ballard and Whitlock 2004). To investigate this issue, we 
compared two novel mitochondrial genomes from the 
Glomeraceae family with a G. irregulare isolate. As all AMF 
mitochondrial genomes from the Glomeraceae family that 
have been sequenced so far share the same mitochondrial 
synteny, we used this as a comparative basis for our study. 
We assessed mitochondrial genome evolution with regards to 
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two other species of this ubiquitous AMF family, which 
showed divergence in mitochondrial gene organization. We 
wanted to know whether the new mtDNA reported here, 
from the species Glomus sp. DAOM-240422 and G. cerebri- 
forme DAOM-227022, could shed light on the impact of 
mobile element insertions with regards to genome organi- 
zation, and thus assessed the diversity, distribution, and 
propagation of mitochondrial dpo sequences within the 
Glomeromycota phylum. We also explored their evolutionary 
relatedness to other taxa, with respect to their global position- 
ing within all known plasmid-related dpo sequences in the 
public databases. 

Materials and Methods 

Fungal Material and DNA Extraction 

Spores and mycelium of G. irregulare (DAOM-234179), 
Glomus sp. (DAOM-240422), and G. cerebrlforme (DAOM- 
229022) were cultivated in vitro on a minimal (M) medium 
solidified with 0.4% (w/v) gellan gum (Gelzantm, Gelrite) in 
association with Ri-T transformed carrot roots following the 
protocol described by Becard and Fortin (1988). Plates were 
incubated in the dark in an inverted position at 25 °C, and after 
several weeks, abundant mycelia and spores were produced. 
Spores and mycelia were extracted from M medium by solu- 
bilization of the gellan gum in lOmM sodium acetate-citrate 
buffer (pH 6) (Doner and Becard 1991) and washed in sterile 
water. The resulting fungal material was further purified by 
hand under a binocular microscope to remove root fragments. 

Spores and mycelium were suspended in 400^1 of the 
DNeasy Plant Mini Kit API buffer (Qiagen) and crushed with 
a pestle in 1 .5 ml micro tubes, and the DNA was purified ac- 
cording to the manufacturer's recommendations. Purified 
DNA in a final elution volume of 40|iL was stored at 
-20 °C until use. 

Long Polymerase Chain Reactions 

The long polymerase chain reaction (PCR) protocol was per- 
formed using Takara LA Taq (Takara Bio, Canada) following 
the manufacturer's recommendations in a volume of 50^1 
containing 0.2 mM deoxynucleotide (dNTP) solution mix, 
1 .5 mM MgCl2, and 0.5 ^iM of each primer and 1 |il of tem- 
plate DNA. The primer pairs summarized in table 1 were used 
to validate the synteny in Glomus sp. DAOM-240422. Cycling 
parameters were 94°C/3 min, followed by 38 cycles of: 94 °C/ 
30 s, 54 °C/25 s, 68 °C/1 2 min, and a final elongation at 72 °C 
for 8 min. PCR products were separated by electrophoresis in a 
1 % (w/v) agarose gel and visualized with GelRed under ultra- 
violet light. 

Sequencing, Assembly, and Gene Annotation 

Glomus irregulare DAOM-234179, Glomus sp. DAOM- 
240422, and G. cerebrlforme total DNA was sequenced 



using 454 Titanium FLX shotgun technology and the resulting 
reads were assembled in one mitochondrial contig using 
Newbler (Genome Ouebec Innovation Center, McGill 
University, Montreal, Canada). Gene annotation was per- 
formed with the automated organellar annotation software 
MFannot (http://megasun.bch.umontreal.ca/cgi-bin/mfanno1/ 
mfannotlnterface.pl, last accessed August 23, 2013) (for 
more information on MFannot see Burger et al. [2013]), fol- 
lowed by manual inspection for the presence of frame shifts 
and the introduction of missing gene features. SIRs were also 
manually annotated using NCBI BLAST, and their secondary 
structure was predicted using the RNAfold web server 
(Gruber et al. 2008). The annotated sequences of the three 
new AMF mitochondrial genomes were deposited in GenBank 
under the accession numbers KC 164354, KC 164355, and 
KC 164356 for G. irregulare DAOM-234179, Glomus sp. 
DAOM-240422, and G. cerebrlforme, respectively. Circular 
and linear maps of the mitochondrial genomes (figs. 1 and 
2) were created using OGDraw v.1.1 (Lohse et al. 2007). 

Network Analyses 

Two similarity network analyses, where each node represents 
a dpo sequence and an edge represents a sequence similarity/ 
identity were performed using EGN (Halary et al. 2013). The 
first network was constructed from a data set of 91 
Glomeromycotan dpo sequences (from our collection and re- 
trieved from GenBank). All of the sequences were compared 
with each other using BLASTn (Altschul et al. 1 990) according 
to the following parameters: reciprocal best hit with a mini- 
mum of 1 E- 1 0 e-value, and 30% minimum identity covering 
at least 30% of the smallest sequence. The second and larger 
network includes an extensive sampling of all DNA polymer- 
ases from a broad range of taxa. This data set was built using 
translated Glomeromycota dpo sequences as queries for 
BLASTp searches on the Genbank nr database. After retrieving 
hits that showed a minimal e-value of 1 E-05, we performed a 
second BLASTp search to be confident that we had covered 
the largest possible diversity of DNA polymerase gene se- 
quences. The protein similarity network was calculated using 
BLASTp with a minimal e-value threshold of 1E-40, 20% 
minimal similarity covering at least 20% of the smallest se- 
quence. The network layouts were further produced by 
Cytoscape software, using an edge-weighted force-directed 
model, meaning that genes sharing more DNA identity/ 
protein similarity appear closer in the display. 

Phylogenetic Analyses 

Maximum likelihood phylogenetic analyses of AMF mitochon- 
drial genes were performed with a data set of seven mito- 
chondrial gene nucleotide sequences {atp6, atp9, cox2, nadi, 
nad2, nad4, and nad6) that were concatenated using DAMBE 
software version 5.2.13 (Xia and Xie 2001). The alignment 
was done using MUSCLE version 3.8.31. The analyses were 
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Table 1 

Long PCR Primers Used to Validate the Glomus sp. DAOI\/I-240422 
Mitochondrial Synteny 



mtDNA 


Primers 


Sequences (5 -3 ) 


Size (bp) 


region 










F 


OrGGCTCTATTCGCCTTAATGA 




atp9-coxl 


R 


ACCAGGAAGAAGATCATAACGA 


5,029 




F 


TGATAGGATTGATGGGTTTCATAG 




nad4L-ml 


R 


GAACATACTTAGCTTTGATGATGGT 


13,094 




F 


TACGGGTGTACAGCTCTATGAGT 




cox3-rns 


R 


TGGACTACGAGGGTATCTAATCCT 


13,610 




F 


TGCTAAAGATATTGGGACTCTCT 




coxl control 


R 


GTAATCTG GTATTCTTCG AG G CA 


8,785 



performed using the GTR + G model (with five distinct gamma 
categories), with 1 ,000 bootstrap replicates. Bayesian analysis 
was performed using MrBayes version 3.2 with the GTR + G 
model (with five distinct gamma categories), four independent 
chains, one million cycles, tree sampling every 10 generations 
and a burn-in value of 40% (supplementary fig. SI, 
Supplementary Material online), which was adequate for 
Markov chain convergence. 

The global dpo protein phylogeny (fig. 46) was performed 
with a data set extracted from the connected component of 
the protein similarity network containing fungal and plant se- 
quences. The MCODE Cytoscape plugin (Bader and Hogue 
2003) was used to extract the most connected cluster, that 
is, the cluster of nodes which connect the most sequences to 
each other. This process allows a sampling in which sequences 
will share a maximum similarity, improving the downstream 
phylogenetic inference. The selected protein sequences were 
locally aligned with COBALT version 2.01 (Papadopoulos and 
Agarwala 2007). The maximum likelihood phylogenetic anal- 
ysis was performed with RaxML using the rtREV model with 
the CAT option (PROTRTREVCAT) and 1 ,000 bootstrap repli- 
cates. This phylogenetic model is an amino acid substitution 
matrix that has been shown to be appropriate for the analyses 
of polymerase genes (Dimmic et al. 2002). Further, Bayesian 
inference was performed with MrBayes using the rtREV + CAT 
model, four independent chains, one million cycles, tree sam- 
pling every 1 0 generations, and a burn-in value of 40%, which 
was adequate for Markov chain convergence. 

The Glomeromycota dpo phylogeny (supplementary fig. 
S6, Supplementary Material online) was performed on a con- 
served domain of 75 amino acids that was present in 60 out 
of the 91 AMF dpo sequences. The sequence alignment was 
done with COBALT version 2.01. The maximum likelihood 
analysis was conducted with the rtREV -n G model (with five 
distinct gamma categories), with 1,000 bootstrap replicates. 
Bayesian inference was performed with MrBayes using the 
rtREV + G model, four independent chains, one million cycles, 
tree sampling every 10 generations, and a burn-in value of 
40%, which was adequate for Markov chain convergence. 



All three phylogenetic analyses were done using the inte- 
grative software TOPALi version 2.5 (Milne et al. 2004). The 
best phylogenetic model was determined using TOPALi ver- 
sion 2.5 model selection test based on the Akaike information 
criterion. The tree figures were completed using TreeGraph 
version 2.0.47 (Stover and Muller 2010). 

Recombination Analyses 

The recombination analyses were performed on the 
Glomeromycota nucleotide identity network group 1 , 2, and 
8 (where the Glomus sp. 240422 dpo inserted in the 
reshuffled intergenic regions clustered). To easily detect the 
presence of distinct recombination signatures, a distance- 
based recombination analyses (supplementary fig. S7, 
Supplementary Material online) was achieved with the recom- 
bination analyses tool (RAT) (Etherington et al. 2005), with the 
low threshold set to 70% nucleotide identity and the high 
threshold set to over 90% identity. To further confirm the 
occurrence of the putative recombination events, we tested 
their statistical significance with a phylogenetic method using 
the Hidden Markov Model (HMM) with the F84 + gaps nucle- 
otide substitution model (Husmeier and Wright 2001), per- 
formed with TOPALi v2.5 (default parameters). 

Results 

Mitochondrial Genome Description and Comparison 

The complete mtDNA sequences of G. irregulare (DAOM- 
2341 79), Glomus sp. (DAOM-240422), and G. cerebriforme 
(DAOM-229022) are double-stranded circular DNA mole- 
cules, are homogeneous within isolate with no DNA sequence 
polymorphisms, that is, genetic segregation of mtDNA in 
these strains appears to be as effective as in other published 
AMF genome. These mtDNAs harbor the typical set of 41 
mitochondrial genes reported so far in AMF (Lee and Young 
2009): two rRNAs, 14 protein-coding genes (PCGs) and 25 
tRNAs. The PCGs include three ATP synthetase (atp), one cy- 
tochrome b (cob), three cytochrome c oxydase (cox), and 
seven NADH dehydrogenase (nad) genes (fig. 1). The 
G. irregulare isolate DAOM-234179 mtDNA has a genome 
size of 75,075 bp with a GC content of 35.7%, which is 
within range of the sizes of other sequences reported for 
this species (68,995-87,754 bp with a ^37% average GC 
content) (Lee and Young 2009; Formey et al. 2012; Nadimi 
et al. 2012). The only true outlying G. irregulare isolate with 
respect to its mitochondrial genome was reported by Formey 
et al. (2012) (i.e., MUCL43204) and has a size of 87,754 bp, 
whereas its conserved mitochondrial genes show only 65.8% 
identity (including introns and ORFs) with G. irregulare isolate 
494. The Glomus sp. DAOM-240422 and G. cerebriforme 
mtDNAs were 86,170 and 59,633 bp, respectively, with GC 
contents of 36.7% and 46.7%. The latter is the smallest AMF 
mitochondrial genome yet reported, and is highly divergent 
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Frc. 1. — Comparison of Glomus irregulare DAOM-234179, Glomus sp. DAOM-240422, and G. cerebhforme mitochondrial genomes. Tlie circular- 
mapping genomes were opened upstream of ml to allow for easier comparisons. Genes on the outer and inner circumference are transcribed in a clockwise 
and counterclockwise direction, respectively. Gene and corresponding product names are atp6, 8, 9, ATP synthase subunit 6; cob, apocytochrome b; coxl-3, 
cytochrome c oxidase subunits; nadl^, 4L, 5-6, NADH dehydrogenase subunits; ml, ms, large and small subunit rRNAs; A-W, tRNAs, the letter corre- 
sponding to the amino acid specified by the particular tRNA followed by their anticodon. ORFs smaller than 100 amino acids are not shown. 



when compared with the available mtDNAs of Glomus 
spp. and Gigaspora spp. The inflated GC% observed in 
G. cerebriforme is nnostly due to reduced intergenic region 
size and the lack of mobile elements, which are mostly AT- 
rich regions, compared with the other strains. The difference 
in the mtDNA sequences observed between these taxa is 
caused by variation in intron content, dpo insertions and the 



presence of ORFs (table 2). Glomus cerebriforme genes are 
encoded on both strands, their order is completely reshuffled 
compared with the other Glomus spp., and they are also phy- 
logenetically distant from those of G. irregulare (supplemen- 
tary fig. SI, Supplementary Material online). On the other 
hand, Glomus sp. DAOM-240422 showed high similarity to 
G. irregulare with regards to coding sequence identity, but 
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Glomus sp. DAOM-240422 



















■■ 1 






J i u 



1* 

-U. 



86,170 bp 



G. imgulare DAOM-234179 1 



MJLU ■ ™ ■■ I 



75.075 bp 



Glomus cerehriforme 



V ^v' -ii^ c ^ ^ 



nad4 atp6 aip9 




59,633 bp 



B 



G, irregulare DAOM-234179 
atp9-rns 



Glomus sp. DAOM-240422 
atp9-coxl 



G. irregulare DAOM-234179 
nad4L-coxl 



Glomus sp. DAOM-240422 2 
nad4L-ml 



G. irregulare DAOM-234179 
cox3-rnl 



Glomus sp. DAOM-240422 42,100 
cox3-ms 



G. irregulare DAOM-234179 
atp9-rns 



alp9 tmS 
11,775 I I 




atp9 


tmS 






/ rns 


11 775 1 


I 









ATP synthases 
NADH dehydrogenases 
cytochromes oxydases 
transfer RNAs 
ribosomal RNAs 
small inverted repeats (SlRs) 
DNA polymerase (dpo) 
open Reading Frames (ORFs) 



Fig. 2. — {A) Linear genome representation to compare the mitochondrial synteny between Glomus irregulare DAOM-234179, Glomus sp. DAOM- 
240422, and 6. cerehriforme. The linear-mapping genomes were opened upstream of rnl to allow for easier comparisons. Corresponding gene clusters 
between G. irregulare DAOI\/l-234179 and Glomus sp. DAOM-240422 are underlined in red, green, and blue. The newly formed intergenic regions of 
Glomus sp. DAOM-240422 are boxed in gray and tagged with a number. (6) Nucleotide identity comparison with BLASTn between the reshuffled Glomus 
sp. DAOM-240422 intergenic regions {atp9-cox1 [box 1 ], nad4L-rnl [box 2], and cox3-rns [box 3]) with their putative homologous sequence in G. irregulare 
DAOM-234179. Homologous regions are indicated by the projections with their corresponding percent identity. 
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Table 2 

AMF Mitochondrial Genome Features 



AMF Strain 


Genome Size (l<b) 


Introns^ 


ORFs'' 


dpo 


SIRs"^ 


Molecular Features 


G. irregulare'' FACE494 


70,606 


26 


10 


5 


26 





G. irregulare'' DAOM197198 


70,783 


26 


10 


6 


27 





6. irregulare'' MUCL46239 


70,818 


26 


10 


6 


28 





G. irregulare'' MUCL46240 


74,797 


26 


15 


5 


32 


— 


G. irregulare'' MUCL46241 


74,797 


26 


15 


5 


32 




G. irregulare DAOl\/124041 5 


70,781 


26 


9 


8 


7 




G. irregulare DAOM234328 


68,995 


26 


9 


8 


7 




G. irregulare DAOM234179 


75,075 


26 


10 


10 


31 




G. irregulare'' MUCL43204 


87,754 


32 


23 


4 


50 




Glomus sp. DAOM229456 


87,763 


32 


19 


4 


7 


Gene hybrids/HGT 


Glomus sp. DAOM240422 


86,170 ^1 




1 19 


10 


29 


Genome rearrangements ■ 


G. cerebriforme ^ 


59,633 ^1 




L. 4 


2 


40 


Synteny shuffling 


Gigaspora margarita 


96,998 


14 


4 


0 


7 


Transplicing/synteny shuffling 


Gig. rosea 


97,349 


14 


4 


0 


7 


Transplicing/synteny shuffling 



''All types including group I and II introns. 

'^ORFs greater than 100 amino acids encoding unknown protein, LAGLIDADG and GIYYIG endonuclease are listed, including intronic ORFs but not dpo fragments. 
SiRs reported so far in AMF mitochondrial genomes. 

'^Based on data reported in Formey et al. (2012), but eroded ORFs were excluded. 



surprisingly, the synteny of these two isolates differed 
substantially. 

Plasmid-Related DNA Polymerase and Sequence Diversity 
of Small Inverted Repeats 

We found 10 dpo insertions in G. irregulare DAOM-234179 
and Glomus sp. DAOM-240422, making them the most dpo- 
rich AMF isolate reported so far, but only two copies were 
observed in G. cerebriforme. The three isolates harbor a huge 
bona fide dpo with ORF extensions, rendering a structure sim- 
ilar to the mitochondrial plasmid found in other fungi (Kim 
et al. 2000). However, we did not find TIRs, which are usually 
a characteristic of recent plasmid integration. We also found 
numerous SIRs in the three genomes. The 31 SIRs in G. Irre- 
gulare DAOM-234179, 29 in Glomus sp. DAOM-240422, and 
40 in G. cerebriforme were divided into 13, 11, and 5 distinct 
types, respectively, based on their hairpin secondary structure 
(supplementary figs. S3-S5, Supplementary Material online). 
The high number of SIR types, compared with the five previ- 
ously described by Formey et al., result from the characteriza- 
tion of SIRs found in single copy, homologous to those 
previously described that could not have been detected oth- 
erwise. Some types are present in all G. irregulare isolates and 
closely related species such as Glomus sp. DAOM-240422, 
whereas others are endogenous to each strain, giving rise to 
a high SIRs sequence diversity. These elements were always 
present in intergenic regions, introns, or at the edge of endo- 
nuclease-encoding ORFs (supplementary tables SI -S3, 
Supplementary Material online), and suggest a close relation- 
ship with the homing mechanism for endonuclease integra- 
tions, as proposed by Formey et al. (2012). However, the 



diversity of possible cleaving sites seems to be greater than 
previously expected with the here-reported novel mtDNA 
sequences. 

Divergent Synteny in Two Novel Glomeraceae Species 
mtDNA 

The G. cerebriforme mtDNA shows a completely different 
gene order compared with the other AMF mitochondrial ge- 
nomes sequenced so far, with many type 5 SIR insertions in 
intergenic regions that could potentially be involved in synteny 
rearrangements. Nevertheless, as mentioned earlier, only two 
dpo insertions are present, in the cox3-rnl and nad6-rns inter- 
genic regions. As G. cerebriforme is phylogenetically distant 
from the publicly available mtDNAs of Glomus spp. and 
Gigaspora spp., no reference scaffold is available to confirm 
any gene order reorganization. In contrast, the comparison of 
Glomus sp. DAOM-240422 mitochondrial genome with its 
close relative G. irregulare DAOM-2341 79, revealed a reshuf- 
fling of two large gene clusters (fig. 2A). One gene cluster 
encompasses cox7, atp6, nad2, nad3, nad6, and cox3, 
whereas the other contains rns, nadS, cob, nad4, nadi, and 
nad4L. This rearrangement creates three novel intergenic re- 
gions in Glomus sp. DAOM-240422, which are atp9-cox1, 
cox3-rns, and nad4L-rnl. This result was confirmed by long 
PCR performed with primers spanning those regions (supple- 
mentary fig. S2, Supplementary Material online). We then 
assessed the sequence identity between those intergenes 
and their corresponding regions in G. Irregulare DAOM- 
234179 (fig. 2fi). We compared the Glomus sp. DAOM- 
240422 atp9-coxl with G. Irregulare DAOM-234179 
atp9-rns and nad4L-coxl intergene; the cox3-rns with 
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G. irregulare DAOM-234179 cox3-ml and atp9-rns regions; 
and finally the nacl4L-ml was connpared with nad4L-coxl 
and cox3-ml. Each Glomus sp. DAOM-240422 intergenic 
region showed high sequence identity to its putative former 
counterpart in G. irregulare DAOM-2341 79, but covered only 
a small fraction of the intergene length, because many more 
ORFs (mostly dpo insertions) are present in Glomus sp. DAOM- 
240422. Local alignment using BLASTn, between the newly 
formed intergenic regions, did not reveal high similarity 
matches. We also constructed a sequence identity matrix of 
all the dpo sequences in its mtDNA and the highest match was 
only 53.1%. 

Glomeromycota dpo Nucleotide Identity Network and 
Phylogenetic Analyses 

To test whether the dpo sequences in Glomus sp. DAOM- 
240422 were orthologous with the sequences found in 
other AMF and to assess evolution of dpo within the 
Glomeromycota, we constructed a nucleotide similarity net- 
work of all dpo genes reported so far in AMF mitochondrial 
genomes (fig. 3A). In total, 12 similarity groups were formed 
and seven singletons were left aside. We observed that five 
Glomus sp. DAOM-240422 dpo sequences (black circles) clus- 
tered in distinct identity groups (G1, G2, G3, G6, and G8), 
whereas the rest were singletons inserted in reshuffled inter- 
genic regions {cox3-rnl and nad4L-rnl), with low sequence 
similarity to each other or to known AMF dpo sequences. 
The Gig. rosea putative mitochondrial plasmid that was previ- 
ously described (Nadimi et al. 2012) was also one of the sin- 
gletons. The two G. cerebriforme dpo copies (light green 
circles) were the sole representatives of similarity group 12. 
Also, the mtDNA-integrated dpo showed a clustering ten- 
dency toward their intergenic region localization (fig. 36). 
Interestingly, three Glomus sp. DAOM-240422 dpo sequences 
which are inserted in the reshuffled intergenic regions atp9- 
coxl, cox3-rns, and nad4L-rnl (fig. 2) clustered, respectively, 
with the AMF dpo group 8 (cox3-rnl), group 1 (atp9-rns), and 
group 2 {cox3-rnl and the Glomus fasciculatum atp9-rns_3 
sequence). 

To compare the identity network and the phylogenetic ap- 
proach, and to assess the relationship of the dpo present 
within each respective mtDNA (i.e., paralogy vs. orthology), 
we constructed a phylogeny based on a conserved protein 
domain. The alignment of all 91 Glomeromycota dpo protein 
sequences pointed out to a 75 amino acid region that was 
common in 50 of the sequences (supplementary fig. S6A, 
Supplementary Matenal online). Interestingly, most of the 
dpo were truncated either upstream and/or downstream of 
that conserved domain. The resulting maximum likelihood 
phylogenetic tree showed robust bootstrap values (>75%) 
in the distal branches that support clades corresponding to 
the similarity groups observed in the gene network, whereas 
the basal branches displayed were weakly supported 



(supplementary fig. S6S, Supplementary Material online). All 
similarity network groups were represented in the tree with 
strong statistical support in bootstrap and Bayesian inference 
values, except for the identity group 1 1 that was not repre- 
sented, because the sequences did not harbor the conserved 
gene core. Surprisingly, this tree revealed only one occurrence 
of paralogy, in the group 12, for the two dpo sequences in- 
serted in G. cerebriforme mtDNA (like it was also observed in 
the nucleotide network). Every other dpo phylogenetic clus- 
ters represent orthology groups. Finally, only one of the three 
Glomus sp. DAOM-240422 dpo sequences found in 
reshuffled intergenic regions was present in the phylogenetic 
analyses, in the group 8 cluster. 

Glomus sp. DAOM-240422 dpo Recombination Analyses 

Three Glomus sp. dpo genes present in the reshuffled inter- 
genic regions clustered with dpo of other AMF taxa inserted in 
different mtDNA localization, rather than with paralogous 
mtDNA sequences (within DAOM-240422 isolate) in the nu- 
cleotide network. To easily screen for homologous recombi- 
nation, we performed a distance-based recombination 
analyses in which these Glomus sp. dpo sequences were com- 
pared with their three closest relatives in the nucleotide iden- 
tity network. For the Glomus sp. atp9-cox1 and cox3-rns 
(supplementary fig. S7A and B, Supplementary Material 
online) comparison, we observed a disruption in the identity 
of the dpo sequences going from 70% to 80% identity and 
spiking abruptly to a near perfect 100% nucleotide identity. 
When compared together, the Glomus sp. nad4L-rnl_1 dpo 
sequence started off with less than 80% identity to the 
G. fasciculatum atp9-rns_3 dpo and then rose sharply 
to almost 95% identity. Conversely, the G. Irregulare 
MUCL43204_cox3-m/_1 dpo started at more than 99% sim- 
ilar to the Glomus sp. nad4L-rnl_l dpo sequence, but then 
dropped to less than 80% identity (supplementary fig. S7C, 
Supplementary Material online). These dpo sequences 
showed typical signatures of homologous recombinants 
regarding their sequence identity, highly reminiscent to 
what was found for a recombinant HIV-I strain (Liitsola 
et al. 2000). 

To further confirm these findings, we performed a statisti- 
cal phylogenetic recombination analysis using the HMM. The 
Glomus sp. DAOM-240422 atp9-coxl showed a perfect 
Bayesian inference value of 1 .0 for each three topologies in 
a given portion of the alignment (supplementary fig. S7A, 
Supplementary Matenal online), which allowed the finding 
of putative recombination break points (between two given 
topologies where one reach a Bayesian inference value >0.95 
and the other <0.05) at nucleotide position 357 and 613 bp 
of the sequence, respectively. The Glomus sp. DAOM-240422 
nad4L-rnl also showed a perfect Bayesian inference value of 
1 .0, but for only two topologies (I and II), thus allowing for the 
determination of one putative break point at position 161 bp 
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Legend: 



Fig. 3. — Similarity network of dpo sequences inserted in AlVlF mitodiondria! genomes. (A) Eacli node represents a dpo insert colored by mitochondrial 
genome. The node size is proportional to the sequence length. Two nodes are connected by an edge when they share significant similarity (reciprocal best 
BLAST hit with a minimum of IE- 10 score, and 30% minimum identity covering at least 30% of the smallest sequence). The layout was produced by 
Cytoscape, using an edge-weighted force-directed model, which brings closer sequences sharing more similarity. Twelve homology groups were formed 
{labeled G1-G12). There are 97 nodes on that network with 336 edges. (B) Same network but nodes are colored by their mitochondrial intergenic region 
localization. The legend shows a linear size relationship between the smallest dpo sequence {Glomus irregulare DAOM-24041 5_cox3-m/_4, 1 29 bp) and the 
largest one (G. cerebriforme atp6-atp9, 3444 bp). 
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(supplennentary fig. 7C, Supplennentary Material online). 
Conversely, the Glomus sp. DAOM-240422 cox3-ms analyses 
did not provide statistical support for any given topologies 
tested with the HMM phylogenetic method (supplementary 
fig. 7B, Supplementary Material online). 

Global Protein Similarity Network with All Known DNA 
Polymerase Genes in the Public Database 

To assess a possible origin and transmission mode of AMF 
mitochondrial dpo sequences, we constructed a protein 
similarity network using a broad spectrum of dpo sequences. 
Interestingly, most of the glomeromycotan plasmid-related 
DNA polymerase proteins, 60 in total, were found clustered 
with 35 other fungi, 28 plants (including two red algae of the 
genus Porphyra), five other eukaryotes {Physarum polycepha- 
lum, Ochromonas danica, two Placozoan sp. and a Cnidaria 
species), four bacteria {Bacillus thuringiensis, two Bacillus 
cereus, and Brevibacillus sp.), and three bacteriophages 
(fig. 4/4). All dpo sequences are present within their respective 
mtDNA or are located in a mitochondrial plasmid. The patchy 
distribution observed among phylogenetically distant taxa 
such as plants and fungi is similar to that of a coxl intron 
that invaded plant mitochondria and was thought to have 
been acquired horizontally from a fungal donor (Cho et al. 
1998). To assess the evolutionary relationship of AMF DPO 
proteins with the other taxa present in the cluster, we selected 
the group gathering the most connected nodes, that is, the 
sequences sharing the most similarity all together. The corre- 
sponding sequences (including plant, Glomeromycotan and 
other fungal sequences), were used to perform a maximum 
likelihood phylogenetic analyses (fig. 46). We used the viral 
and bacterial proteins present in the sub-network to root the 
tree, because they present a distal position in the network. The 
resulting tree showed robust support (100% bootstrap and 
1 .0 Bayesian inference value) for the viral-bacterial group, and 
surprisingly, the plant DPO proteins clustered with those of the 
Glomeromycota as a sister group, supported by a 76% boot- 
strap and 1 .0 Bayesian inference value. Except for the other 
fungi group clade, the phylogeny was robustly supported 
either for the basal and the distal branches. 

Discussion 

Interhaplotype Homologous Recombination in Glomus 
sp. DAOM-240422 

The numerous type 5 SIRs present in G. cerebriforme inter- 
genic regions may have had the potential to trigger intra- 
molecular gene reshuffling. The two G. cerebriforme dpo 
sequences were shown to be paralogs, with a nucleotide iden- 
tity of 81 %. However, we cannot be sure that intra-molecular 
recombination occurred in G. cerebriforme, because we do 
not have closely related taxa as comparators. In the Glomus 
sp. DAOM-240422 mitochondrial genome, the two most 



similar dpo sequences share a low nucleotide identity percent- 
age of 53.1 %. However, we observe a near perfect sequence 
identity for PCGs and some intergenic regions with close rel- 
ative G. irregulare. Taken together, these observations suggest 
that the rearrangements observed in Glomus sp. DAOM- 
240422 are recent and do not result from intramolecular 
recombination from an ancient dpo duplication. Further, no 
recent dpo duplications were observed in any given AMF 
strains surveyed. The Glomus sp. DAOM-240422 dpo se- 
quences that are inserted in the reshuffled intergenic regions 
clustered in the nucleotide network with dpo sequences of 
other AMF species that were inserted in the putatively former 
intergenic arrangement. Comparison of the Glomus sp. 
DAOM-240422 atp9-coxl and nad4L-rnlJ dpo, with their 
three closest relatives showed a typical signature of homolo- 
gous recombination (supplementary fig. S7A and C, 
Supplementary Material online), with genes sharing a high 
sequence identity with different portions of the alignment, 
where intersections represent putative recombination break 
points. These recombination events were strongly statistically 
supported with the HMM method. Even if the Glomus sp. 
240422 cox3-rns dpo recombination was not supported by 
HMM, it still clustered with dpo present in other intergenic 
regions, and the lack of statistical support might be explained 
by the absence of its recombination partner in the cluster. 
These observations strongly suggest that the mitochondrial 
gene cluster reorganization in Glomus sp. DAOM-240422 is 
the result of dpo-mediated homologous recombination be- 
tween different mitochondrial haplotypes harboring highly 
similar dpo orthologs present in distinct genome localization. 

Two types of mobile elements reported in AMF mitochon- 
drial genomes have been previously shown to be involved in 
genome rearrangements through homologous recombination 
in other organisms: the plasmid-related dpo genes in 
Basidiomycota (Ferandon et al. 2008) and SIRs in bacteria 
and plants (Schofield et al. 1992; Bi and Liu 1996; Tanaka 
et al. 2012). Furthermore, the fact that the number of dpo 
sequences varies in different intergenic regions among isolates 
of the same species and closely related ones (Formey et al. 
201 2; Beaudet et al. 201 3), combined with our findings sug- 
gest that 1) there is no evidence of intra-molecular recombi- 
nation in Glomus sp. DAOM-240422; 2) recombination 
between distinct mtDNA haplotypes harboring different dpo 
insertions could occur; 3) as there is no intra-isolate mtDNA 
polymorphism, the reshuffled mtDNA haplotype in Glomus 
sp. DAOM-240422 is fixed in the population; and 4) those 
rearrangements are triggered by dpo insertions through 
homologous recombination. 

Such recombination events could take place following 
hyphal fusion (i.e., anastomosis). This last feature not only 
facilitates the distribution of nutrients and signaling molecules 
through the entire AMF mycelium, but also plays a key role in 
genetic exchange (Giovannetti et al. 1999) and segregation 
(Angelard and Sanders 201 1). Those anastomoses can occur 
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J Glomus sp. DAOM229456 cox3-mLl 
1 Glomus irregulare MUCL43204 AFN42486 

Glomus fasciculatum DAOM240159 atp9-rns_4 
Glomus rrregulare MUCL43204 AFN42484 
Glomus irregulare DAOM240415 cox3-rnl_5 
Glomus sp, DAOM240422 atp9-coxLl 
Glomus fasciculatum DAOM240I59 cox3-mL4 
Glomus irregulare MUCL46241 cox3-mLl 
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Fig. 4. — {A) Sub-cluster of the global network of shared amino acid similarities between the Glomeromycota DPO proteins and all homologous 
sequences found on GenBank. Building this data set was performed using translated Glomeromycota sequences as queries for a BLASTp search on the 
GenBank nr database (minimal e-value threshold of 1 E-40, 20% minimal similarity covering at least 20% of the smallest sequence). The network layouts 
were further produced by Cytoscape software, using an edge-weighted force-directed model, meaning that genes sharing more protein similarity appear 
closer in the display. There are 135 nodes in that network with 2,520 edges. (B) DPO protein maximum likelihood tree obtained with the rtREV-FCAT 
phylogenetic model. The bacterialA/iral cluster was used to root the tree. Number on branches indicates bootstrap support values (<60% cut-off) and 
Bayesian inference values, respectively. The Bayesian analyses gave a similar topology (data not shown). The tree includes AMP (black), other fungi (orange), 
plants (green), virus (red), and bacteria (blue). 
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either between genetically different AMF isolates originating 
from the sanne experimental field (Croll et al. 2009; Angelard 
et al. 2010) or between strains isolated from distant locations 
(e.g., from the same [Purin and Morton 2012], or even differ- 
ent continents [de la Providencia et al. 2013]). Both of these 
studies suggested that mitochondria might also be exchanged 
via anastomosis, thus creating a heteroplasmic state. 

Evidence of Heteroplasmy Challenges the Concept of 
an AMF Individual 

All published AMF mitochondrial genomes are homoplasmic, 
that is, homogeneous within isolates. In other fungi, mech- 
anisms contributing to homoplasmy are often related to the 
recombination of parental mtDNAs or by the selection of one 
of the parental haplotypes due to the presence of segrega- 
tion mechanisms (reviewed by Birky [2001]). In the absence 
of external factors, heteroplasmy should be the default state 
for mtDNA under a simple mutation-drift scenario (White 
et al. 2008). In yeasts, the presence of a heteroplasmic 
state was shown to be transient due to the presence of seg- 
regation mechanisms, such as nucleoid formation, and 
mtDNA molecular repair pathways which maintain mito- 
chondrial genome integrity (Zimmer et al. 1991; Hu et al. 
1995; Yasuhira and Yasui 2000). The presence of nucleoids, 
similar mtDNAs linked by Holliday junctions, induces a ge- 
netic bottleneck responsible for a fast segregation (Lockshon 
et al. 1995; White and Lilley 1997; MacAlpine et al. 1998). 
The mitochondrial genome homogeneity observed for in 
vitro and in vivo AMF cultures could be the result of such 
effective mtDNA segregation and repair mechanisms that 
would take place during the subcultivation processes. 

The AMF colonies are characterized by the interconnected- 
ness of its different parts by means of hyphal fusion (i.e., anas- 
tomosis) (Giovannetti et al. 1 999; de la Providencia et al. 2005; 
Voets et al. 2006; Purin and Morton 2012), which can lead to 
genetic exchange, even between genetically distinct individ- 
uals. This hyphal fusion dynamic in natural populations can 
also prevent the lost of genetic diversity (Bever and Wang 
2005). Recently, anastomoses between geographically distant 
G. irregulars isolates cultivated in vitro were shown to induce 
transient mitochondrial length heteroplasmy (with variation in 
dpo insertions between the two mitochondrial haplotypes) (de 
la Providencia et al. 201 3). Although in situ heteroplasmy have 
never been demonstrated, these observations, in addition to 
the interhaplotype dpo homologous recombination evidences 
presented here, suggest that transient heteroplasmy could 
occur in the field. 

The coexistence of numerous AMF mitochondrial haplo- 
types in the same cytoplasm and the occurrence of homolo- 
gous mitochondrial recombination might be commonplace in 
nature, as it was shown in natural populations of the basid- 
iomycete fungus Armillaha gallica (Saville et al. 1998). The 
heteroplasmy stability in natural populations, may provide 



useful additional information for defining haplotypes, and re- 
solving further the relationships among individuals at a pop- 
ulation level (White et al. 2008). Given the increasing use of 
mitochondrial markers in AMF research, the demonstration 
that genetically distinct mtDNAs can be exchanged through 
anastomosis, coexist in a common cytoplasm and give rise to 
a recombinant haplotype, should lead to a thorough discus- 
sion on the AMF individuality concept. If heteroplasmy is 
confirmed in natural populations, the use of mtDNA as a cri- 
terion to define a reliable AMF taxonomic unit would certainly 
facilitate our understanding of the processes occurring in nat- 
ural populations, although caution should still be applied be- 
cause it would be artificial, and would not reflect the 
genotypic diversity of AMF nuclear DNA, because mitochon- 
drial and nuclear DNA segregation are mostly stochastic 
mechanisms. 

Vertical Versus Horizontal Inheritance of the AMF 
Mitochondrial dpo 

Given the high genetic diversity of dpo and their variable trun- 
cation patterns, a classical phylogenetic approach requiring 
multiple alignment of highly similar sequences, is not suitable 
to analyze their overall evolutionary relationships, especially 
for recombinant ones (Bapteste et al. 2012). Indeed, the clas- 
sical approach only allowed us to analyze 60 of 91 
Glomeromycota dpo sequences, on a limited 75 amino acids 
conserved domain, leading to a tree with poorly supported 
basal branches (supplementary fig. 66, Supplementary 
Material online). As an alternative, gene similarity network 
analyses have already been proven to be useful in the study 
of highly diverse gene families and introgressive descent 
events, such as recombination and HGT (Bapteste et al. 2012). 

The AMF dpo nucleotide network highlights the existence 
of a clustering tendency for dpo sequences located in the 
same intergenic regions (fig. SB). The high similarity (short 
edges) observed between the sequences of each group 
and the phylogenetic topology (supplementary fig. S66, 
Supplementary Material online) suggest that they are ortho- 
logs that could have recently been acquired vertically from a 
common ancestor. This is not surprising for the G. irregulare 
isolates, but implies that the other species that shared those 
sequences have also diverged recently (i.e., G. fasciculatum 
and G. aggregatum) given the high within-group sequence 
similarity. However, the seven singletons left aside by the net- 
work nucleotide similarity clustering (five in the closely related 
species Glomus sp. DAOM-240422) suggest that numerous 
independent plasmid-mediated dpo insertions, from un- 
known origin, and/or followed by a fast divergence might 
also have occurred. The putative Gig. rosea plasmid had low 
sequence similarity with the other dpo insertions in AMF 
mtDNA and no extra chromosomal dpo were found within 
any of the surveyed AMF genomes. This dpo sequence 
diversity could indicate that AMF mitochondria harbor a 
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broad range of plasmid types, as described in Neurospora 
species, where seven circular and four linear homology 
groups of mitochondrial plasmids have been reported 
(Schulte and Lambowitz 1991; Yang and Griffiths 1993a, 
1993b; Arganoza et al. 1994; Marcinko-Kuehn et al. 1994). 

It has been found that mitochondrial plasmids in plants 
could be horizontally acquired through pollen (Handa 2007). 
HGT has also been hypothesized to be an important factor in 
the dispersal of mitochondrial plasmids between fungal strains 
and species. That assumption is based on 1) discordant phy- 
logenetic relationships between plasmids and host genomes 
(Kempken et al. 1992), 2) the geographic distribution of plas- 
mids within and among host species (Arganoza et al. 1994), 
and 3) direct experimental demonstration of HGT (Collins and 
Saville 1990; Griffiths et al. 1990; Debets et al. 1994). Until 
now, no direct experimental evidence of HGT of dpo se- 
quences was provided, and no mitochondrial plasmid has 
yet been found in AMF (except a putative linear plasmid in 
Gig. rosea). However, the distribution of the plasmid-related 
dpo sequences is uneven between isolates of the same AMF 
species (Formey et al. 2012) and we showed that there is a 
large diversity of dpo sequences within AMF (12 orthology 
groups, along with seven singletons). Another intriguing ob- 
servation is that the dpo sequences are also present in all AMF 
family surveyed including the Glomeraceae, Gigasporaceae, 
Ambisporaceae, Diversisporaceae, and Archaeosporaceae 
(data not shown, based on a preliminary 454 assembly). A 
mechanism known as post-fusion incompatibility might be 
the most plausible explanation for the ubiquitous distribution 
of AMF mtDNA dpo. This mechanism implies a brief hyphal 
fusion between incompatible AMF species and the formation 
of septa. This process has already been shown to allow ex- 
change of nuclear genetic material (Croll et al. 2009), and also 
offers an explanation for the mobile endonuclease-mediated 
HGT, leading to the formation of gene hybrids in the mito- 
chondrial genome of Glomus diaphanum-Wke species 
(Beaudet et al. 2013). Keeping in perspective that divergent 
AMF taxa can simultaneously colonize roots of the same plant 
species, it is likely that a mitochondrial plasmid could spread 
rapidly through the Glomeromycota phylum even between 
phylogenetically distant taxa. 

Glomeromycota dpo Are Closely Related to Plant 
Mitochondrial Plasmids 

More than 50 linear plasmids have been reported in the mi- 
tochondria of 20 fungal species, in contrast to the 1 4 found in 
eight plant species. These have only been observed twice in an- 
imal mtDNA, in the placozoan Trichoplax adhaerens and in the 
moon \e\\y Aurelia aurita (Shao et al. 2006; Signorovitch et al. 
2007). This uneven distribution of mitochondrial plasmids 
within the Eukaryota raises questions about their origin and 
their mode of transmission. 



The structure of the plant mitochondrial plasmids consists 
of an invertron structure, long inverted repeats with proteins 
covalently bound to their 5'-ends. This structure is remarkably 
similar to fungal plasmids and to some DNA viruses (Rosewich 
and Kistler 2000). Additional evidence, such as low GC con- 
tent in plant plasmids (30% in the Brassica 1 1.6kb plasmid, 
38.9% in the 10.4kb sugar beet plasmid, and 37-39% in the 
maize S plasmids) compared with their respective mtDNA 
{Brassica napus 45.2% [Handa 2003], Beta vulgaris 43.9% 
[Kubo et al. 2000]) and the maize 44.0% (GenBank accession 
DQ490951), supports HGT from a fungal donor as the 
source of linear mitochondrial plasmids in plants, because 
low GC content is typically found in fungal mtDNA (Handa 
2008) (with an average of 37% in G. irregulare isolates 
[Handa 2008; Formey et al. 2012]). Moreover, our phyloge- 
netic analyses robustly support the DPO proteins of plants 
and Glomeromycota as a sister group. Taken together, 
these evidences suggest that AMF could be potential donors 
of plasmid-related dpo to plants. However, species of 
Glomeromycota colonize the same ecological niche and 
often co-occur with many pathogenic or endophytic fungi in 
plant roots, and one of these co-occurring fungi could be the 
common donor to both plants and AMF, also plant plasmids 
could have different fungal origins. If this is the case, the 
organism in question has yet to be sequenced. 

To further study the AMF mitochondrial plasmid-related 
dpo genes, the mitochondrial genome of Geosiphon pyrifor- 
mis, an ancestral member of Glomeromycota that lives in 
endocytobiotic association with the cyanobacterium Nostoc 
punctiforme (Gehrig et al. 1996), could provide insights into 
the origin and evolution of those elements in that phylum. 
Also, some DNA polymerase genes were present in the 
mtDNA of the ascomycete fungal partner of the lichen 
Peltigera malacea (Xavier et al. 2012). Those sequences clus- 
tered within the other fungi group in the global DPO protein 
network. Another interesting avenue would be to assess the 
horizontal transmission of mitochondrial plasmids in different 
fungi-plant symbiotic systems. 

Conclusions and Outlook 

The mitochondrial genome comparison of the three Glomus 
species showed the extent of mtDNA plasticity with regards 
to their synteny. It also offers a basis to develop molecular 
tool kits aimed specifically at identifying and quantifying taxa 
of the ecologically and agriculturally important Glomeraceae 
family. However, finding evidence of interhaplotype homol- 
ogous recombination suggests the occurrence of mtDNA 
heteroplasmy in natural populations. Although this has yet 
to be demonstrated, it raises questions about the artificiality 
of single spore cultivations of these fungi, which probably 
induces an underestimation of the mtDNA diversity in situ. 
Approaches like genotyping-by-sequencing and ecotilling 
could be useful to assess this particular hypothesis. 
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Mitochondrial genome comparisons can also provide insights 
into molecular processes related to the biology and commu- 
nity structure of AlVlF. The close phylogenetic relationship 
between the plasmid-related dpo of plants and Glomeromy- 
cota suggests that HGT events could have occurred between 
plants and their fungal symbionts. The intimate relationship 
between plants and AMF in nature may unravel other trans- 
fer events that have taken place through evolution. Also, the 
role of viruses and bacteria and their interactions with fungi 
and plants represents an interesting research avenue to gen- 
erate knowledge about the evolution and origin of mito- 
chondrial plasmids in those groups. 

Supplementary Material 

Supplementary tables S1-S3 and figures S1-S7 are available 
at Genome Biology and Evolution online (http:/AAAAAA/.gbe. 
oxfordjournals.org/). 
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